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The Lyman-a (Lya) emission line is the primary observational signature of star- 
forming galaxies at the highest redshifts^, and has enabled the compilation of large 
samples of galaxies with which to study cosmic evolution^"^. The resonant nature of 
the line, however, means that Lya photons scatter in the neutral interstellar 
medium of their host galaxies, and their sensitivity to absorption by interstellar 
dust may therefore be enhanced greatly. This implies that the Lya luminosity may 
be significantly reduced, or even completely suppressed. Hitherto, no unbiased 
empirical test of the escaping fraction (fesc) of Lya photons has been performed at 
high redshifts. Here we report that the average /esc from star-forming galaxies at 
redshift z = 2.2 is just 5 per cent by performing a blind narrowband survey in Lya 
and Ha. This implies that numerous conclusions based on Lya-selected samples 
will require upwards revision by an order of magnitude and we provide a 
benchmark for this revision. We demonstrate that almost 90 per cent of star- 
forming galaxies emit insufficient Lya to be detected by standard selection 
criteria^"^. Both samples show an anti-correlation of /esc with dust content, and we 
show that Lya- and Ha-selection recovers populations that differ substantially in 
dust content and/sc 
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The hydrogen Lya emission hne, thanks to its high intrinsic luminosity (Z.Lya)\ 
high equivalent width (Wi^ya)^'^ and very convenient rest- frame wavelength (1,216 A), 
continues to act as a staple tracer of distant star-forming galaxies. However, the Lya 
transition is a resonant one, causing photons to scatter in the neutral hydrogen 
component (H l) of the interstellar medium, so that path lengths to escape may be 
greatly increased compared to non-resonant radiation. Thus, depending on the content, 
distribution and kinematics of H I, and the dust content, /esc for a galaxy may fall 
anywhere within the range of to 1 (refs 8-1 1). It follows that for the field of Lya 
astrophysics to bloom — from using the line to select high-z galaxies, to a physically 
meaningful diagnostic of star formation — a detailed empirical examination of /esc in 
cosmological galaxies is essential. 

Estimating/esc at high z is extremely challenging, because the requisite 
supporting data are observationally expensive to obtain. Comparison of star-formation 
rates (SFR) derived from Lya with those from ultraviolet continuum can be used to 
infer </esc> = 30-60% from several studies of z = 2 and 3 Lya samples"^'^''^. However, 
this assumes that the ultraviolet continuum is un-attenuated, and is strongly dependent 

1 3 

on models of stellar evolution to provide the respective SFR calibrations . Furthermore, 
this technique is only valid if star formation has proceeded at a constant rate over the 
last -100 Myr. More importantly, restricting analysis to Lya-selected samples neglects 
potential star-forming galaxies that do not show Lya emission. Comparing 
independently determined Lya and ultraviolet luminosity flinctions provides an 
alternative, but cosmic variance can easily introduce errors of a factor of two (ref. 5), 
and the aforementioned uncertainties in SFR calibration remain. Using theoretical 
galaxy formation models, lower values of /sc = 2% (ref 14) to 10% (ref 15) have been 
estimated at z = 3, but these methods suffer from the large number of ad hoc parameter 
assumptions that enter the models. A significant step forward can be taken if Lya is 
compared with another, non-resonant hydrogen recombination line (for example. Ha), 
since both intrinsic strengths are a direct function of the ionizing luminosity. This has 
been done atz = 0.3, placing/sc = 1-2%) (ref 16) but cosmological application is 
restricted by the >7 billion years over which galaxies can evolve to z > 2. 
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With a new, very deep survey using the ESO Very Large Telescope (VLT), we 
have overcome all of these issues simultaneously. We have performed a blind, unbiased, 
narrowband imaging survey for Ha and Lya emission at z = 2.2 using custom 
manufactured filters to guarantee the same cosmic volume is probed in both emission 
lines (Supplementary Information). Thus, although cosmic variance does affect the 
number of objects in our survey volume, its effect cancels from any volumetric 
properties we derive by comparison of the two samples. With observational limits 
sensitive to un-obscured SFRs of 1.9 solar masses per year (l.9M^ yr~') in Ha, we 

■in 1 

identify 55 new Ha emitters . Lya observations are sensitive to SFR = 0.26M^ yr 
assuming /esc = 1 (/esc = 0.13 for the faintest Ha emitters), and we identify 38 new 
galaxies. Lya and Ha luminosities are shown in Fig. 1 (see also Supplementary 
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Information). Targeting the GOODS-South field, we benefit from some of the deepest 
broadband optical and infrared data in existence, compiled into public source 
catalogues'^. From these we obtain stellar spectral energy distributions (SEDs), which 
allow us to estimate dust content (Eb-v) and intrinsic SFR. We find all the Ha galaxies 
and 21 of the Lya emitters in public catalogues. 

From the Lya sample we construct the observed Lya luminosity function, 
LF(Lya). Using the same formalism, we derive the first intrinsic Lya luminosity 
function, LF(Lya), from the Ha sample, using measurements of Eb-v^o correct the Ha 
luminosity for extinction, and assuming the standard Lya/Ha line ratio of 8.7 for 
ionization bounded nebulae ("case B"; see Fig. 2). In this way, we obtain the intrinsic 
and observed Lya luminosity densities, the ratio of which leads us directly to a 
volumetric /esc = (5.3 ± 3.8)%, with no dependence on cosmic variance, the evolutionary 
state of the galaxies, or calibration uncertainties. The method is sensitive to the dust 
correction, as we assume that the same extinction applies to the continuum and to Ha, 
and thus we perform the same test without correcting Ha luminosities for dust, finding 
the most conservative upper limit of (10.7 ± 2.8)% — a limit free of any model 
dependency whatsoever. This first key result shows that commonly practised survey 
estimates of the total Lya luminosity density at z > 2 will significantly underestimate its 
intrinsic value: on average, only 1 in 20 of the intrinsic Lya photons is accounted for. 
This may not be surprising at the highest redshifts (that is, above z = 6) where an 
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increase in the neutral fraction of the intergalactic medium may cause significant 
suppression of the Lya line , but at z = 2-A this effect is likely to be small, and the 
photons must be lost in the interstellar medium of individual galaxies. This result is 
especially consequential, because the integrated luminosity density converts directly to 
the cosmic rate of star-formation, implying that pure Lya-based estimates of volumetric 
SFR are in need of strong upward revision. 

To investigate the origin of this underestimate, we examine the individual 
galaxies. Having modelled the SEDs^"* for all the objects found in the broadband 
catalogues, we obtain homogeneous derivations of dust extinction (Eb-v) and SFR for 
both samples. From the Lya and Ha luminosities, Eb-v estimates and recombination 
theory, we compute fesc in individual sources (limits on /esc are derived for sources 
detected in only one line by assigning the 1 cr limiting flux to non-detections). In Fig. 3 
we show how/sc correlates with Eb-v^ot our observed galaxies, where we also plot the 
position of 50,000 synthetic galaxies generated using the 'MCLya' radiation transfer 
code^^, and the/esc-£'B-F relationship expected from pure dust attenuation^^. All the 
synthetic galaxies fall below the theoretical curve, and every observed galaxy except for 
one lies within 1 cr of this region. This demonstrates how Lya photons are preferentially 
absorbed in the interstellar medium of almost all of our galaxies. 

Despite the nonlinearity introduced by the multi-parametric Lya transfer 
problem, /sc and remain clearly anti-correlated. The correlation shows a gradient 
that is 50% steeper than that predicted by pure dust attenuation: the effective extinction 
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coefficient, knie, is found to be 17.8 instead of 12 for normal attenuation . More 
striking in Fig. 3 is the lack of overlap and significant offset between the populations: 
the Lya and Ha samples are almost disjoint in both quantities. We measure median 
values of £3-^-= 0.085 (0.23) and/esc > 0.32 (<0.035) for Lya (Ha) emitters. 
Furthermore, we find the median SFRs to be very different between the two samples: 
3.5M^ yr~^ for Lya, and lO.OM^ yr~^ for Ha. Thus Lya galaxies are significantly less 
powerful in forming stars, less dusty, and show higher /esc than Ha galaxies. For Lya, 
these estimates are based on the 21 brighter galaxies found in public source catalogues 
and including the remainder would be likely to increase the disparity between the 
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samples. In f^sc the difference is further accentuated by the fact that we are considering 
lower and upper limits for the Lya and Ha samples, respectively. 

Another significant result is that of 55 Ha and 38 Lya emitters, we detect only 6 
galaxies in both lines. These galaxies straddle the individual distributions, with 5 of the 
6 falling within 1 crof the dust attenuation curve. This is unsurprising when examined in 
light of the individual samples, but it is unlikely that such a relationship would have 
been predicted on the basis of the z « objects of similar luminosity, where little 
obvious correlation is found between the two line intensities ' . The fact that so few 
Ha emitters are detected in Lya can be attributed to a combination of two factors. First, 
the extinction coefficient at Lya is substantially larger than at Ha (knie = 12.0 
compared to ^6563 = 3.33) : the median Eb-v for the Ha emitters corresponds to a 50% 
reduction in Ha luminosity but an /esc value of just 7%. Second, the fact that only Lya 
scatters serves to exacerbate this, and the grey points in Fig. 3 show how fesc can be 
reduced to below 1%, even with minuscule dust contents. Indeed, for constant star 
formation (after the equilibrium time of ~100 Myr) with a 'standard' initial mass 
function and metallicity, Whya is ~80 A (refs 6, 7), and preferential suppression of Lya 
by just a factor of 4 would render a galaxy undetected in the survey. Short-lived burst 
scenarios increase Wi^ya to >200 A (refs 6, 7), requiring preferential attenuation factors 
of ~10; these are still easily attainable at very modest ^g-^- (ref 1 1). Similarly, the low 
number of Lya sources detected in Ha is explained by the large range of escape 
fractions exhibited by star-forming galaxies: Lya selection preferentially finds galaxies 
with higher fesc values and smaller attenuation in Ha, resulting in line ratios nearer the 
recombination value and comparatively faint Ha. This pushes the Ha fluxes below our 
detection hmit for most galaxies, despite the very deep Ha data. 

Increasing the number of co-incident detections is extremely challenging 
observationally. Owing to the wide range of relative line intensities, a large range of 
luminosities needs to be spanned in both lines, requiring each observation to be both 
wide and deep. This is currently feasible in Lya, but large (-0.5 degree ) infrared 
imagers are still non-existent on telescopes of the 8-1 0-m class. Extending this survey 
to higher redshift will remain unfeasible until the James Webb Space Telescope comes 
online. 
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Figure 1. Observed Ha and Lya luminosities. Sources detected only in Ha are shown 
in red, those detected only in Lya in blue, and common detections in green. All error 
bars are Icr photometric uncertainties. Objects undetected in Lya or Ha are represented 
as upper limits placed at the detection limits of the Ha or Lya data, respectively (black 
dashed lines). The dashed magenta line shows the Lya/Ha ratio for case B 
recombination in the absence of dust, and the dotted line shows Lya = Ha. For a sample 
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of dust-free galaxies, complete in both lines, all objects should line up on the 
recombination line, with dust and the effects of radiation transfer serving only to move 
objects away from the line in the direction of Z^Lya < 8.7ZHa • The dashed cyan line 
shows 8.7 times the 5 cr detection limit for Ha applied to the Z^Lya axis. For the case B 
recombination ratio, all galaxies falling above this line should be detected in Ha (see 
Supplementary Information for more details). No objects occupy this region of the 
diagram with significance above 1 cr. 




Figure 2. Lya luminosity functions. O is the number density of galaxies per decade in 
luminosity. The SFR labelled on the upper abscissa corresponds directly to the 
luminosity on the lower. The luminosity flinction shaded blue, at lower luminosity, 
shows the observed luminosity distribution, derived from the VLT/FORSl observations. 
The function shaded cyan, at higher luminosity, shows the intrinsic luminosity function, 
denoted LF(Lyao), derived from the HAWKI observations by correcting the Ha 
luminosities for dust attenuation, and multiplying by the case B Lya/Ha ratio of 8.7. 
Black open circles show the bins of the respective luminosity functions, with vertical 
error bars representing 68% confidence limits. For the observed LF(Lya), this error is 
derived from Poisson statistics and incompleteness simulations alone. For the intrinsic 
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LF(Lya), the error bar also includes the error on the dust correction which is 
randomized on every realization of the Monte Carlo simulation, allowing galaxies to 
jump between adjacent bins (see Supplementary Information for details). The shaded 
regions associated with each luminosity function represent the regions of 68% 
confidence derived from the Monte Carlo. For each realization, both intrinsic and 
observed LF(Lya) are regenerated and fitted with the Schechter function; integration 
over luminosity between and infinity then provides us with the observed and intrinsic 
Lya luminosity densities. Volumetric ^^sc follows directly as the ratio of these two 
quantities, and is found to be (5.3 ± 3.8)%. Scaling the 68% limits of the intrinsic 
LF(Lya) by this fraction in luminosity results in the dashed black lines, which clearly 
and comfortably encompass the observed distribution. 
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Figure 3. Escape fraction (/'esc) and dust attenuation (Eb-v)' All objects found in the 
broadband photometry catalogue (that is, for which we can recover Eb-v) are included. 
Green shows galaxies detected in both lines, blue shows detections only in Lya, and red 
shows detections only in Ha. All error bars are derived from propagation of 
measurement and model fit uncertainties, and represent 68% confidence. The grey 
clouds show the positions of 50,000 synthetic galaxies produced using the MCLya 
radiation transfer code, and are labelled R.T. models. The black line shows the dust 
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attenuation law of Calzetti, which should be valid in the absence of resonance 
scattering. The magenta line shows the relation that best fits the observed data points 
using Schmitt's binned linear regression algorithm , a survival analysis algorithm able 
to account for data points and limits in both directions which does not require a priori 
knowledge of the distribution of the censored parent population. The number of data 
points is 67 (55 Ha emitters +18 Lya emitters - 6 common detections). Parameterized 
as = 10^^^^^ '-^^'^"^ , we find the extinction coefficient Miie to be 50% higher 

(A:i2i6 = 17.8 instead of 12.0) than the curve of pure dust attenuation. All but a few data 
points fall in the region swept out by the radiation transfer code, and demonstrate the 
significance of the spatial and kinematic structure of the neutral interstellar medium in 
the transfer of photons. Furthermore it is clear that the areas of the fesc-Es-v diagram 
populated by Lya- and Ha-selected galaxies are almost disjoint: the Lya sample are 
significantly less dusty and exhibit higher escape fi-actions than the Ha sample. This 
clearly shows how the populations recovered by the respective selection functions are 
very different, despite the fact that the physics governing the production of the two 
emission lines is identical. 



Supplementary Information 



Survey redshift matching and the custom bandpass 

Critical for this survey is the matching of the cosmic volumes probed by the Lya and Ha legs. In right 
ascension and declination this is easily done thanks to the very similar fields — of — view of the HA WK- 
/''^^ and FORSP^ cameras, although the redshift dimension is more challenging and required the 
manufacture of a custom narrowband filter. For Ha we adopted the HA WK-I/NB2090 bandpass 
(k(r'2.Q95\im; AX,=0.019nm) and obtained the overall system throughput curve from the ESO HAWK-I 
Instrument Support Team. We shifted this bandpass in wavelength to the domain required to sample Lya 
from the same Az as NB2090 samples Ha: effectively multiplying the wavelength axis of the filter by 
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1216/6563. We procured a 120mm diameter filter with an extremely similar bandpass for the blue- 
optimised FORSl instrument, the normalised system throughput of which, together with the shifted 
NB2090 filter, is shown in Figure SIl . These two filters are clearly an ideal match to sample this slice in 
redshift. The Lya filter (A,c^3880; AX,=37.0) is designated NB388 by ESO and throughout this manuscript. 

Observations and data reduction 

The targeted field lies in the GOODS-South fleld^^ centred at a=03''32'"32.'88; 5^27''47'°16'. For the 
7.'5 X 7.'5 field— of— view of HAWK-I, the 19lA fiill width at half maximum of NB2090, and a 
cosmology of Ho=70 km s"' Mpc"\ Qa=0.7, Qm=0.3, this corresponds to a survey volume of 5440 Mpc^. 
Selection of the GOODS-S field provides us with a wealth of auxiliary data, including the Chandra X-ray 
Observatory, Hubble Space Telescope (HST) observations in BViz, ESO Imaging Survey data in 
UBVRIJHK^, and Spitzer Space Telescope data at 3.6, 4.5, 5.8, 8.0, and 24(mi. Furthermore our pointing 
adopts a position angle of —44° and completely encompasses the Hubble Ultra Deep Field^'* giving us the 
deepest BVizJH data in existence for around one eighth of our survey area. 

Details of the HA WK-I observations, data reduction, target selection, and luminosity function (LF) have 
aheady been presented^^. Briefiy, using NB2090 and imaging, we identify 152 narrowband-excess 
candidates with equivalent widths above 63. 8A (i.e. 20A in the restfiame). Using the GOODS-MUSIC 
broadband catalogue'^, we confirm 55 of these to be Ha-emitting galaxies at z~2.2 based upon 
spectroscopic and photometric redshifts, and BzK colour criteria. The limiting depth of these narrowband 
data is AB=24.6 (5cj), which corresponds to a line flux of 6.8 x 10''^ erg s"' cm"^. In turn this corresponds 
to an unobscured star-formation rate of 1.9Mg yr"', assuming a Salpeter^^ IMF and solar metalhcity'^. 
From analysis of the equivalent width distribution of our z=2.2 galaxies we determine that our selection 
criterion of a minimum equivalent width causes us to underestunate the total Ha luminosity density by 
between 1 and 16%. We construct LF(Ha) which we find to be well fit by as Schechter function with the 
parameters of log(L*/erg s"') = 43.22, log((p*/Mpc"') ^3.96, and a= -1.72. 
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Using VLT/FORSl we obtained very deep narrowband imaging data over an almost identical area using 
the custom NB388 filter and also in the continuum using the high throughput U and B broadbands. Data 
were obtained in visitor mode during dark time (less than 3 days from new moon and with the moon 
below the horizon under the duration of observations) over the period of consecutive nights beginning 23 
to 27 of December 2008. Table 1 shows the characteristics of the three filters, the total integration times, 
and number of dithered pointings. Nights beginning 23, 25 and 26 were photometric throughout the 
duration of the observations. Seeing during the observing run was typically good, never exceeding 1 .2". 



Table 1 F0RS1 observations 



Filter 


Ac [A] 


FWHM [A] 


# Exposures 


Exp. time 
[s] 


miim 5a [ AB ] 


NB388 


3880 


37 


36 


60,900 


26.4 


U 


3606 


513 


6 


3600 


26.4 


B 


4397 


1030 


9 


3490 


26.75 



Table note: In the U- and B- bands, these exposure times and number of pointings correspond to the 
observations obtained with VLT/FORSl during this observing run. The limiting magnitudes, however, are 
derived from the VLT/FORSl images stacked with the data obtained from the ESO Imaging Survey, and 
thus are significantly deeper than could be obtained in the quoted integration times. The FORSl 
observations were obtained in order to deepen, and better homogenise the dataset. 

Data were reduced using standard tasks in NOAO/IRAF: bias subfraction, flat-field correction, and sky 
subfraction were performed. Images were then registered onto a common astrometric grid and co-added. 
U and B images were stacked together with the ESO Imaging Survey U and B images. The asfronomical 
seeing in the flnal NB388 frame was 0.84". The NB388 frame was calibrated using standard star GD50, 
observed at various airmasses during photomefric time. In the final reduced frames we estimate 
incompleteness with the ARTDATA package in NOAO/IRAF. We model the point-spread function (PSF) 
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of our final images and insert artificial sources at random positions within the image. We test their 
recovery using the source extraction and photometry software, described in the ft)llowing section (using 
the same configuration). Limiting magnitudes in the various unages are also listed in Table 1. The 5cj 
NB388 limiting magnitude corresponds to a line flux of 7.8 x 10"'^ erg s"' cm"^. For our faintest Ha fluxes, 
this represents a Lya escape fraction of 13%. 

Photometry, selection, and catalogue assembly 

We perform all source detection and photometry with SExtractor^^. Source detection is done in the NB388 
image, requiring a minimum of 5 contiguous pixels and a threshold signal — to — noise (S/N) of 5 above 
the local backgroimd. Photometry is done in the combined U and B — ^band images using SExtractor in 
'double-image' mode, ensuring that apertures are matched between the respective frames. We adopt the 
MAG AUTO method for estimating magnitudes. From the two broadband catalogues we perform a 
power-law interpolation to estimate an effective magnitude (referred to as UB) at the wavelength of the 
NB388 filter. We define our selection based upon observed equivalent width, which translates directly 
into UB-NB388 colour. To make our selection directly comparable to previous narrowband Lya 
surveys^"''"'' we require a restframe ffo, 

Lya > 20A. We show the colour — magnitude plot for all the 
detected sources and the selection of candidates in Figure SI2. 

We find 38 galaxies that match this criterion, of which six are also found to be Ha emitters. In Figure 1 of 
the main article we plot the observed luminosities in Ha and Lya of all the candidates. Objects undetected 
in either of the lines are assigned a luminosity based on the 5o detection limit of the appropriate 
narrowband observation. The dashed magenta line shows the line ratio expected from case B^" 
recombination and none of the objects are found to lie above this line with a significance greater than lo. 
The cyan line shows the 5cj Ha detection limit scaled by the case B line ratio for Lyo/Ha: the luminosity 
above which all Lya-selected objects should be detected in Ha. 2 of the 32 candidate Lya emitters 
undetected in Ha lie above this line. This can be interpreted as either the presence of a clumpy ISM^''^^ or 
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a non-thermal contribution to the production of Lya (e.g. winds^^ or cooling radiation'*") although it is 
important to stress that both objects are consistent with the limit within lo photometric error, and most 
likely are placed there by scatter. 

We cross-correlate all of our candidates with the GOODS-MUSIC^^ catalogue in order to obtain 
broadband magnitudes (or limits in the case of non-detections). 21 of the objects are found in the 
catalogue. Four of the GOODS-MUSIC objects have measured spectroscopic redshifts (spec-z) of which 
two are lower redshift interlopers and two are confirmed Lya emitters with redshifts consistent with the 
bandpass. For all the candidates (including the four with spec-z's), we measure their photometric redshifts 
(phot-z) the with the Hyper-z^"^ code, both in its default state and modified with the inclusion of nebular 
emission lines"^ (see also the following section). Note that for aperture and methodological consistency, 
we use as input to Hyper-z only the magnitudes published in the GOODS-MUSIC catalogue; we never 
mix GOODS-MUSIC photometry with our own. Phot-z tests naturally confirm that the two 
spectroscopically confirmed interlopers are indeed interlopers, and that the two z=2.2 Lya emitters are 
indeed at z=2.2. Of the 17 remaining galaxies, we find phot-z's consistent with z=2.2 in all cases apart 
fiom one; this object is removed fiom the sample. For a narrowband filter centred at ?tc=3880A there are 
very few emission lines that could possibly contaminate the Lya sample: only the [OII]A,3727A at z=0.04 
(a comparatively tiny cosmic volume) and the CIVXA,1548,1550A doublet at z=1.5 from active galactic 
nuclei (AGN). Thus it is not surprising that so few interlopers are found in the sample. In order to identify 
any objects powered by non-thermal nuclear accretion, we cross-correlate our sample with the 1 Mega- 
second Chandra X-ray catalogue'^^ ''^ but find that no X-ray detections within 2.5" of any of our Lya 
candidates. We thus define an interloper rate of 3/21 = 14%. We use this quantity later when selecting 
Lya candidates not found in the GOODS-MUSIC catalogue to include in the derivation of the Lya 
luminosity function. 

Finally, we have SEDs for 55 Ha emitters and 18 Lya emitters. Since six of these galaxies are found in 
both emission lines we have at total of 55+18-6=67 SEDs. 
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SEP fitting, dust extinction, and star-formation rates 



For the 67 objects with SEDs we are able to perform detailed fits of the spectral energy distribution using 
the Hyper-z code^". We fix the redshift to 2.19, and use synthetic stellar evolutionary models"'* assuming 
exponentially decreasing rates of star formation, SFR, = SFRoC"*''' with t in the range (simple stellar 
population) to oo (constant star formation) and approximately logarithmically spaced between 5 and 3000 
Myr. We assume a Salpeter'^ initial mass function and metallicity Z=\l2iZ^. Hyper-z employs a standard 
}^^minimiser to cover the full parameter space by brute force, and we fit overall normalisation, stellar age, 
star-formation history and dust extinction using the Calzetti prescription^*^, saving the covariance matrix. 
We derive confidence limits at lo Irom analysis of the distribution, searching and interpolating across 
the full 4-dimensional parameter space and estimating the probability distribution function collapsed onto 
each dimension. Thus we are able to obtain confidence limits on our best fitting parameters. For this 
Letter, however, the only properties made use of are Eb-vwA star-formation rate. Note that it is this 
stellar estimate of £'a_f'that we use to estimate the reddening undergone by nebular photons (Ha), and 
apply in our estimate of the intrinsic Lya luminosity. Studies of star-forming galaxies (8 objects) in the 
nearby universe^* have found a nebular measured irom the Balmer decrement to systematically lie a 
factor of 2 higher than p. measured from the stellar continuum. However, application of the Calzetti 
law for dust attenuation^^ should remove this discrepancy as shown by observations of star-forming 
galaxies at z~2 (114 objects) which show no such offset, instead finding a tight one — ^to — one correlation 
between dust-corrected SFR(UV) and SFR(Ha)''^ 

We compute some average properties for the respective samples of Lya and Ha emitters: Es—r^fesa, and 
SFR. These quantities are found to be rather different with Lya emitters less dusty, less star forming, and 
with higher f^^ than Ha galaxies. The Ha sample is complete in detections in the GOODS-MUSIC 
catalogue but Lya sample is not (21/38) and thus the average values of EB_ydinA may be biased with 
respect to the complete sample. However, it is known that for continuum-selected samples"^'"' and also 
Ha"** that extinction in general increases with increasing star-formation rate or luminosity. Thus the 
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fainter, undetected Lya emitters are likely to exhibit lower dust extinctions than those from which the 
sample-averaged values are calculated, increasing the disparity between the average values computed for 
the Lya and Ha samples. By association, and in accordance with the trends presented in Figure 3, this 
would also likely manifest itself as an increase infuse for the Lya-selected sample, increasing the 
separation between the samples in^sc. 

Luminosity functions, Monte Carlo simulations, and escape fractions 

We estimate the intrinsic LF(Lya) by firstly taking the raw luminosities and EB_fr estimates for Ha 
emitting galaxies and correcting for dust attenuation. For reference our raw Ha luminosity fiinction 
shows excellent agreement with previous measurements at z~2 (ref 49). We then multiply each intrinsic 
Z/Ha by the case B Lya/Ha ratio of 8.7 (ref 20) to obtain the intrinsic Li^ya for the individual objects. We bin 
the objects by luminosity in 4 bins to create the "observed intrinsic" LF(Lya), shown in cyan in Figure 2. 
Errors on each bin are derived from Poisson statistics. Using the mean EB_y per bin and the factor of 8.7 
again, we convert our intrinsic LF(Lya) bins back to bins defined by observed magnitudes and, again 
using the ARTDATA package, simulate incompleteness of the HA WK-I NB2090 data. Incompleteness is 
propagated back into the intrinsic LF(Lya) and its errorbars. To this we then fit the Schechter function 
using a standard minimiser and integrate over luminosity between and infinity to obtain the intrinsic 
Lya luminosity density, Pmt(LLya), for our z=2.2 volume. 

We assemble the observed LF(Lya) in a similar manner by first including all the 18 confirmed z=2.19 
Lya emitters with redshifts consistent with 2.19. From the 17 candidates with continuum too faint to be 
found in the GOODS-MUSIC catalogue, we randomly select galaxies with an 86% chance of inclusion 
(i.e. based upon the 14% interloper rate). All selected objects are binned in observed i^Lya, and 
incompleteness is simulated in the FORSl NB388 frame using ARTDATA as described previously. We 
again fit a Schechter function and integrate over luminosity to obtain the observed Lya luminosity 
density, Pobs(^Lya)- For reference our observed LF(Lya) is in reasonable agreement with the z=3.l LF'*'^ 
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although does find L* around one magnitude brighter at the lo confidence level. The observed LF(Lya) is 
shown in blue in Figure 2. Note again that while our survey volume is small and cosmic variance may be 
significant, the volume is perfectly matched between the Ha and Lya legs of the survey and cosmic 
variance divides away from results based on the the differential comparison of the two LPs. 

We present the Schechter parameters for both the observed and intrinsic LF(Lya) in Table 2. With 
measurements of the observed and intrinsic Lya luminosity densities, we define the volumetric Lya 
escape fraction as Pobs(^Lya) / Pmt(^Lya)- From the errors on and E^_y for the Ha sample, and ^Lya only 
for the Lya sample we randomly regenerate the catalogues and Lya luminosity functions, and re -perform 
the above procedure over a 1,000 realisation Monte Carlo simulation. Objects among the 17 candidates 
not found in the GOODS-MUSIC catalogue are randomly re-drawn on every realisation based upon the 
z=2.2 confirmation rate of the objects that were. This simulation yields a volumetric escape fraction of 
(5.3±3.8) %. Over each reahsation we also perform the same calculation without the inclusion of the 
reddening correction on Ha as a security check to find the absolute maximum volumetric escape fraction. 
Here we simply calculate Pobs(^Lya) / [ 8.7 x pi„t(iHa) ]• This places the most stringent limit on the 
volumetric ^sc(Lya) of (10.7±2.8)% and, while higher than our primary method by a factor of 2, this 
derivation contains not one single model dependency. 

Since the luminosity densities are obtained by integration from zero to infinity, this represents significant 
extrapolation in luminosity from the limited range spanned by our survey. To assess the impact of this, we 
also sum directly the luminosities of our Lya and Ha emitters and find the fraction of integrated 
luminosity missed by the survey. We see 68% of the luminosity in Lya and 65% for Ha and thus, while 
the range covered by our survey is limited, around 2/3 of the total luminosity is accounted for purely by 
addition of luminosities; simply summing the fluxes of Lya and Ha emitters would give very similar 
results. 

The selection function for Lya emitters rejects all objects with J^Lyo,o< 20A and, if those objects 
contribute a large fraction to the total Lya luminosity, this will go imdetected by Lya surveys that cut at 
the canonical value. To investigate this we examine the equivalent width distribution of the Lya galaxies 
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in our sample, by first fitting an exponential fiuiction to the distribution as previously done at redshifts 2 
(refs 12, 50) and 3 (ref 4), and secondly by investigating the luminosity distribution cumulative in ffLya- 
The exponential form of the Wi^ya distribution has an e-folding scale of 76 A, the same as that observed at 
z=3.1 (ref 4) and shghtly higher than the value of 40-50A observed at z=2, although poor statistics are 
likely responsible for the discrepancy. Thus between the limits of Wj^ya being independent of Lya 
luminosity and independent of continuum luminosity density, we miss between 3 and 22% of total 
luminosity. Adopting the narrower z=2 P^Lya distributions this increases to around 30%. However, 
examining the cumulative luminosity distribution as a fiinction of Wj^ya we find that the underestimate is 
likely about 20%, which would cause the volumetric escape fraction to increase to aroimd 6%. On the 
other hand, a very similar result is also found the Ha emitters'^: around 20% of the total luminosity 
density is missed, and therefore overall it is likely that this effect largely cancels in the computation of 



Table 2 Schechter parameters of the Lya luminosity function 

log L* [ erg s'^ ] log <p* [ Mpc'^ ] a 

Observed 43.16+/- 0.32 —3.63 +/- 0.52 —1.49+/- 0.27 

Intrinsic 44.47+/- 0.35 —3.96 +/- 0.68 —1.65+/- 0.33 



The radiation transfer models and theoretical Lya escape fractions 

MCLya^^'^^ represents the current state — of — the — art software for three dimensional radiation transfer of 
Lya and continuum photons. Physics is implemented to include dust scattering and absorption, HI 
scattering, frequency and angular redistribution, all in arbitrary geometries and velocity fields. All our 
models are carried out assuming a spherically symmetric, homogenous and co-spatial shell distribution of 
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HI and dust with constant density and temperature. Photons are injected centrally with the shell described 
by four physical parameters: 

• A^Hi [ cm"^ ] The radial HI column density 

• Fexp [ km s"^ ] The expansion velocity of the shell 

• Ta [ dimensionless ] The radial optical depth due to dust absorption 

• b [ km s"' ] The Doppler parameter describing the microscopic HI velocity distribution 

Model Lya emission lines are generated a posteriori from arbitrary Lya input spectra described by their 
full width at half maximum (FWHMi^ya) and J^Lya-/esc is computed for any model as the ratio of the 
transmitted Lya flux to that of the input spectrum. With MCLya we have computed a large and complete 
grid of transfer models through shells covering the parameter space: log A/Hie[16 — 21.7] cm"^; Fexpe[0 — 
500] km s"'; [0 — 5]; Z)e[10 — 160] km s"\ In total the grid consisted of 5,200 combinations of shell 
parameters with gridpoints approximately logarithmically spaced. Using the escape fractions of 
continuum photons near to Lya and the same extinction law as applied to the observations^^, Ta converts 
to the Eb-v^{^ — 0.36]. To complete the range of models we generate spectta for a range of input 
FWHMi^y^G [50 — 700] km s"', giving us^esc and Eb—v for approximately 50,000 synthetic galaxies. 
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Figure SIl : The normalised instrumental response curves. The red line shows the HA WK-I/NB2090 
("cosmological Ha") filter, re-scaled along the wavelength axis to sample Lya (I.e. the theoretical perfect 
filter for Lya). The blue curve shows the throughput of FORS1/NB388, which almost perfectly matches 
the defined set by the re-scaled NB2090. 
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Figure SI2. The selection of narrowband excess candidates in colour — magnitude space. The dashed line 
corresponds to an equivalent width cut of 20A in the z=2.2 restframe. Detections are represented by black 
spots, candidates by magenta. 
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